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1 How one can measure subsets in the free group? 
1.1 Motivation 

The present paper is motivated by needs of practical computations in finitely 
presented groups. In particular, we wish to develop tools which can be used in 
the analysis of the "practical" complexity of algorithmic problems for discrete 
infinite groups, as well as in the analysis of the behaviour of heuristic (e.g. 
genetic) algorithms for infinite groups [22, 23]. 

In most computer-based computations in finitely presented groups G = F j R 
the elements are represented as freely reduced words in the free group F, with 
procedures for comparing their images in the factor group G — F/R. Therefore 
the ambient algebraic structure in all our considerations is the free group F = 
F{X) on a finite set X = {xi, . . . ,Xm}- We identify F with the set of all 
freely reduced words in the alphabet XUX~^, with the multiplication given by 
concatenation of words with the subsequent free reduction. 

The most natural and convenient way to generate pseudorandom elements 
in G is to produce pseudorandom freely reduced words in F. The most abstract 
mathematical model of a random word generator in F is just a probabilistic 
distribution on F. We find ourselves in the setting of the paper [4], which initi- 
ated a general discussion of probabilistic measures on the free group. Analysis 
of complexity of algorithms on groups necessarily involves the study of their 
behaviour with respect to the size of the input, usually, the length of input 
words. Different probabilistic distributions on F represent pseudorandom gen- 
erators with varying mean length of words. This mean length is one of the most 
important parameters of a pseudorandom generator. Since we wish to vary the 
mean length of inputs, a single fixed distribution on F does not suffice, and 
we need a parametric family of probabilistic distributions V = {Pi} of varying 
mean length I of elements. This leads to the crucial point of our approach: a 
measure of a given subset i? C -F is not a particular number Pi{R) (which is 
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usually meaningless), but rather a function ViR) : / Pi{R) which naturally 
encodes all statistical properties of R with respect to the family of distributions 
v. It turns out that such well-known asymptotic characteristics of R as asymp- 
totic density, co-growth rate, etc., arc just the standard analytic characteristics 
of the function V. This opens the way to apply classical analytical methods 
for description of statistical behaviour of algorithms in groups. In Section 1.5 
we introduce a hierarchy of subsets R in F with respect to their size, which is 
based on linear approximations of the function This hierarchy is quite 

sensitive, for example, it allows one to differentiate between sets with the same 
asymptotic density. 

Our requirements to probabilistic distributions are motivated by a very prac- 
tical, engineering approach to computations in groups. First of all, the prob- 
abilistic distribution should not be unnatural in the context of computational 
group theory. It should provide an easy way to make crude estimates of proba- 
bilities of various subsets important in standard problems of group theory: sub- 
groups (first of all, normal or finitely generated subgroups), coscts with respect 
to subgroups, conjugacy classes, sets of words of special nature (say, squares 
or commutators). It should also provide for an easy analysis of asymptotic 
behaviour of probabilities when the mean word length tends to infinity. 

Many sets we wish to measure have happened to be context free languages 
[28]. An important subclass is made of regular subsets (that is, subsets produced 
by deterministic finite automata). This very natural class of subsets includes 
finitely generated subgroups and their cosets, and finitely generated cones (sets 
of all words which start with an initial segment belonging to a given finite set of 
words). The class of regular sets in F is closed under Boolean operations, and 
under translation and conjugation by elements of F. 

1.2 Generation of random words in F 

Let F = F{X) be a free group with basis X = {xi, . . . ,Xm}- We use, as our 
random word generator, the following no-return random walk Wg {s £ (0,1]) 
on the Cayley graph C{F,X) of F with respect to the generating set X. We 
start at the identity element 1 and either do nothing with probability s (and 
return value 1 as the output of our random word generator), or move to one 
of the 2m adjacent vertices with equal probabilities (1 — s)/2m. If we are at a 
vertex v ^ 1, we either stop at v with probability s (and return v), or move, 
with probability ^^^-i ? ^'^ "-"^^ '^^ ^hc 2m — 1 adjacent vertices lying away from 
1, thus producing a new freely reduced word vxf^. In other words, we make 
random freely reduced words w of random lengths \w\ distributed according to 
the geometric law 

p{\w\ = fc) = s(i - ,s)^ 

in such way that words of the same length k are produced with equal probabili- 
ties (in terminology of [4] , we say that our measure is moderated by the geometric 
distribution of NU { }) . Observe that the set of all words of length fc in forms 
the sphere Sk of radius k in C{F,X) of cardinality l^fel = 2m{2m — 1)*^"^. It 
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is easy to see that the resulting probabihstic atomic measure^ on F is given 
by the formula 

and 

/x.(l)=s. (2) 

Thus, iJ.s{w) is the probability that the random walk Wg stops at w. The 
mean length Lg of words in F distributed according to /ig is equal to 

oo ^ 
wEF fe=l 

Hence we have a family of probabilistic distributions fj, = {/Zg} with the stopping 
probability s G (0, 1) as a parameter, which is related to the average length Lg 
as 

1 



By we denote the function 

IJ,{R) : (0, 1) ^ M 

s Ms(-R); 

we call it measure of R with respect to the family of distributions /i. 

Denote by = nk{R) = \Rr\ Sk\ the number of elements of length k in R, 
and by fk = fk{R) the relative frequencies 

„ \RnSk\ 

Jk 



\Sk\ 

of words of length k in R. Notice that /o = 1 or depending on whether R 
contains 1 or not. Recalculating ij,{R) in terms of s, we immediately come to 
the formula 

oo 

/x(i^) = s^^(l-s)^ 

fe=0 

and the series on the right hand side is convergent for all s G (0,1). Thus, 

for every subset R C F, fi{R) is an analytic function of s. When studying the 
behaviour of /i(-R), we mostly restrict it to real arguments, but occasionally need 
to work with extensions of to larger regions of the complex plane. We use 
only most basic facts of the theory of analytic functions which can be found in 

any book on complex analysis. 

^ Recall that a measure /i on a countable set X is atomic if every subset y C X is measur- 
able. This is equivalent to saying that every singleton subset {x} is measurable. Obviously, 
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Notice that the asymptotic behaviour of the set R when ^ oo corresponds 
to the behaviour of the function /x(-R) when s ^ 0+. This will be discussed in 
more detail in Section 1.4. Here we just mention how one can obtain a first 
coarse approximation of the asymptotic behaviour of the function fJ-{R). Let 
Wo be the no-return non-stop simple random walk on C{F,X) (like Wg with 
s = 0), where the walker moves from a given vertex to any adjacent vertex 
away from the initial vertex 1 with equal probabilities l/2m. In this event, the 
probability A(w) that the walker hits an element w £ F in |w| steps (which is 
the same as the probability that the walker ever hits w) is equal to 

Xiw) = - — — — ^ I , , if w j^l, and A(l) = 1. 

This gives rise to an atomic measure 

oo 

X{R)=Y.Xiw) = Y,MR) 

weR k=0 

where A(i?) is just the sum of the relative frequencies of R. This measure is 
not probabilistic, since some sets have no finite measure (obviously, X{F) = oo), 
moreover, the measure A is finitely additive, but not cr-additive. We shall call 
A the frequency measure on F. If R is A-measurable (i.e., A(i?) < oo) then 
fk{R) — * when fc — > oo, so intuitively, the set R is "small" in F. 

A number of papers (see, for example, [1], [5], [27], [37]), used the asymptotic 
density (or more, precisely, the spherical asymptotic density) 

p{R) = limsup/fe(i?) 

as a numeric characteristic of the set R reflecting its asymptotic behavior. Un- 
fortunately, the asymptotic density is not even finitely additive, and it is not 
sensitive enough: many interesting sets have asymptotic density either 1 or 0. 

More subtle analysis of asymptotic behaviour of R in some cases provides 
the relative growth rate 

7(ii) = limsup V^CR). 

Notice the obvious inequality j{R) < 1. If 7(-R) < 1 (we will have to say more 
about this case in Section 2), then, by an elementary result from Calculus, the 
series J2 fk converges. This shows that if 7(i?) < 1 then R is A-measurable. 

Our distribution /Zj, has the uncomfortably big standard deviation a = ■ 
This reflects the fact that it is strongly skewed towards 'short' elements. How- 
ever, since real life computations take place in the vicinity of 1, we believe that 
our model is useful as a first step in developing statistical approach to compu- 
tational group theory. 
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1.3 The multiplicativity of the measure and generating 
functions 

It is convenient to renormalise our measures /Xg e /x and work with the para- 
metric family /x* = {/ij} of adjusted measures 

h:h = (-^--)-hsH. (3) 



^ 2m — 1 s ^ 

This new measure /z* is multiplicative in the sense that 

tJ,l{uov)= iJ.;{u)iJ,*{v), (4) 

where u o v denotes the product of non-empty words u and v such that = 
\u\ + \v\ i.e. there is no cancellation between u and v. The measure /j. itself is 
almost multiplicative in the sense that 

IJ.s{u ov)= ciJ.s{u)iis{v) for c = -^^-^ ■ (5) 

for all non-empty words u and v such that \uv\ = \u\ + \v\. 
If we denote 

then 

M:H=il"l (7) 

for every non-empty word w. 

Similarly, we can adjust the frequency measure A making it into a multi- 
plicative atomic measure 

Let now ii be a subset in F and rifc = nk{R) — \Rr\ Sk\ be the number 
of elements of length k in R. The sequence {nfc(i?)}^Q is called the spherical 
growth sequence of R. We assume, for the sake of minor technical convenience, 
that R does not contain the identity element 1, so that no = 0. It is easy to see 
now that 

oo 

li*{R)=J2nkt''. 

One can view /i* (R) as the generating function of the spherical growth sequence 
of the set R in variable t which is convergent for each t G [0, 1). This sim- 
ple observation will allow us (see Sections 3 and 4) to apply a well established 
machinery of generating functions of context-free languages to estimate proba- 
bilities of sets. 
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1.4 Ceseiro density 

Let /i = {/ijj} be the parametric family of distributions defined above. For a 
subset i? of -F we define the limit measure /Uo(i?) : 

oo 

lio{R)= lim fi{R)= lim ^.^/^(l-s)^ 

The function /io is additive, but not (T-additive, since /xo(w) = for a single 
element w. It is easy to construct a set R such that lims^Q+ /x(-R) does not 
exist. However, in the applications that we have in mind we have not yet 
encountered such a situation. Strictly speaking, fiQ is not a measure because 
the set of all /Uo-measurable sets is not closed under intersections (though it is 
closed under complements). Because Hs{R) gives an approximation of Ho{R) 
when s ^ 0+, or equivalently, when Ls oo, we shall call R measurable at 
infinity if /xo(i?) exists, otherwise R is called singular. 

If n{R) can be expanded as a convergent power series in s at s = (and 
hence in some neighborhood of s = 0): 

IJ,{R) = mo + mis + m2S^ H , 

then 

/io(-R) = lim l^{R) = iTT'O- 

s— >0+ 

A corollary from a theorem by Hardy and Littlewood [13, Theorem 94] (see 
Corollary 5.2 in Section 5) asserts that no can be computed as the Cesaro limit 

lxoiR)= lim -(/i + ••• + /„). (9) 

n— ^oo n 

So it will be also natural to call /xq the Cesaro density, or asymptotic average 

density. 

Notice, that the Cesaro density no is more sensitive then the standard asymp- 
totic density p. For example, if ii is a coset of a subgroup H of finite index in 
F then it follows from Woess [37] that 

1 

Mo(-R) - . jjy 

while, obviously, p{H) = 1 for the group H of index 2 consisting of all elements 
of even length. 

On the other hand, if lim^^oo fk{R) exists (hence is equal to p{R)) then 
Ho{R) also exists and po{R) = p{R)- In particular, if a set R is A-measurable, 
then it is /xo-measurable, and po{R) = 0. 

1.5 Asymptotic classification of subsets 

In this section we introduce a classification of subsets i? in F according to the 
asymptotic behaviour of the functions n{R). 
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Let /i = {/is} be the family of measures defined in Section 1.2. We start 
with a global characterization of subsets of F. 

Let iZ be a subset of F. By its construction, the function /Lt(i?) is analytic 
on (0,1). The subset R is called rational, algebraic, etc, with respect to /i if 
the function is rational, algebraic, etc. We say that R is smooth if /i(i?) 

can be analytically extended to a neighborhood of and is regular at 0. 

Algebraic sets and context free languages. If the set R is an (unam- 
biguous) context free language then, by a classical theorem of Chomsky and 
Schutzcnberger [6], the generating function n*{R) = 'Y^rikt^, and hence the 
function /u(i?), are algebraic functions of s. Moreover, if R is regular then /u(i?) 
is a rational function with rational coefiicients [9, 34]. 

An important class of example of algebraic subsets is provided by a theorem 
of MuUer and Schupp [21]: A normal subgroup R<iF is & context free language 
if and only if the factor group F/R is free-by-finite. Notice that, for the derived 
subgroup R = [F. F] of the free group of rank 2, the measure /i(i?) is not an 
algebraic function. Richard Sharp kindly informed us that this follows from a 
remark on p. 127 of his paper [29]. See also Example 2. 

It is well known that singular points of an algebraic function are either poles 
or branching points. Since ^J,{R) is bounded for s € (0, 1), this means that, for a 
context-free set R, the function /i(i?) has no singularity at or has a branching 
point at 0. After uniformisation, we can expand /x(-R) as a fractional power 
series: 

Ijl{R) = mo + mis^/" -|- m2S^/" + ■■■. 
If R is regular, than we actually have the usual power series expansion: 

li{R) = mo + niis + TO2S^ + • • • ; 

in particular, /i(i?) can be analytically extended in the vicinity of and R is 
smooth. 

Linear approximation. If the set R is smooth then the linear term in the 
expansion of n{R) gives a linear approximation of n{R): 

li{R) = mo + rnis + 0{s^). 

Notice that, in this case, toq = Ho{R) is the Cesaro density of R. It can be 
shown (see Corollary 5.3 in Section 5) that if Ho{R) = then 

oo 

fe=i 

On the other hand, even without assumption that R is smooth, if R is A- 
measurable (that is, the series J2fk{R) converges), then, by Corollary 5.3, 

lio{R) = and lim ^ = X{R). 
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This give us a good excuse to use for the limit 

/xi = lim , 

if it exists, the same term frequency measure as for A. The function jji is an 
additive measure on F (though it is not cr-additive). 



Asymptotic classification of sets. Now we introduce a subtler classification 
of sets in F (which is based on the linear approximation of /x(i?): 

• Thick subsets: /xo(-R) exists, /xo(i?) > and 

/x(i?) = /Lto(i?) + ao(s), where lim ao{s)=0. 

s— »0+ 

• Sparse subsets: ij-o{R) = 0, Hi{R) exists and 

IJ,{R) = ^i{R)s + a-i_{s) where lim = 0. 

s^0+ s 

• Intermediate density subsets: /Uo(-R) = but /ii(i?) does not exist. 

• Singular sets: Ho{R) does not exist. 

Wc put on record the following simple observation which follows from dis- 
cussions in Section 1.2. 

Lemma 1.1 Every X-measurable set is sparse. In particular, if j{R) < 1 then 
R is sparse. 

We shall see in Section 3 that, for the important class of regular sets, the 
generating function is a rational function and hence every regular set is either 
thick or sparse. 



1.6 Degrees of polynomial growth 

In this section we introduce degrees of polynomial growth "on average" for 
functions on the free group with respect to the family of distributions fi. In 
particular, it would produce hierarchies of the average case complexity of various 
algorithms for infinite groups, which would make meaningful statements like 
"the algorithm works in cubic time on average" . A different approach to degrees 
of growth "on average" was suggested in [4] . 

Let /u = {ns} be the family of measures constructed in Section 1.2 and A be 
the frequency measure on F. Let / : F„ — > M be a non-negative real valued 
function. 

The average value E). = Ek{f) of the function / on the sphere Sk (with 
respect to A) is equal to: 

e,= y: fiw)x{w) = E #T- 

weSk \w\=k ' 
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For every fixed stopping probability s S (0, 1) we evaluate the mean value 
Mf{s) of the function / with respect to fig as 

Mf{s) = 

oo 
k=0 

If for every s & (0, 1) the value Mf{s) is finite then the function Mf{s) is called 
the mean value of / with respect to the family of distributions n. The growth 
of the function Mj at s corresponds to the growth of the mean values of / 
with respect to the family fj, when the mean length L = ^ — 1 tends to infinity. 
Therefore, if we rewrite Mf{s) in the variable L: 

M^L)=Mf(^j^y Xe(0,oo), 

then the growth of Mjf at oo reflects the growth of the initial function / when 
the length of words tends to oo. This allows one to introduce the notion of the 
polynomial growth of / on average. 

Let v : (l,oo) — > M be an arbitrary continuous probability density on the 
interval (l,oo) and v{x)dx the corresponding probabilistic measure. We say 
that a non-negative real valued function f : F -^M. has a polynomial growth of 
degree d on average with respect to fj, and v if the function Mjf has polynomial 
growth of degree d on average with respect to v, i.e., the following improper 
integral converges at oo: 

I — — v{x)dx. 
Jo x'l ^ ' 

and d G N is the minimal with this property. 

If ri{s)ds is the measure on (0, 1) obtained from i'[x)dx by the change of 
variables s = l/(a; + 1), then this is the same as to say that 

/ s'^Mf{s)r]{s)ds 
Jo 

converges at 0. In most cases we can use the standard measure ds on (0, 1). 

Elementary results from analysis give the following simple and useful test 
for polynomial growth of functions on average. 

Lemma 1.2 Let f : F ^M. be a non-negative real valued function on F. If the 
mean value function Mf{s) is defined for s € (0, 1) and, in the vicinity of 0, 

Mf{s) = 0{s-'') 

for some positive integer d then f has polynomial growth of degree at most d on 
average for any continuous probabilistic measure ri{s)ds on (0, 1). 
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Our definition of polynomial growth on average is justified by the following 
simple observation. 

Lemma 1.3 The function /{w) = has growth of degree n on average. 
Proof. In view of Lemma 1.2, it will suffice to prove that, in the vicinity of 

0, 

Mf{s) = 0{s-). 
We shall work with a larger function 

g{w) = {\w\ + l){\w\+i)---{\w\+n). 

Without loss of generality, we can assume ^(l) = 0. It is easy to see that its 
mean 

CO 

Mg(s) = s^(A; + l)(fc + 2) • • • (fc + n)(l - sf. 
fe=i 

After changing the variable, ^ = 1 — s, it is enough to prove that the function 

•oo 

M{z) = {l-z)- ^(fc + 1)(A; + 2) • • • (A; + n)z^ 
fe=i 

has a pole of degree at most d aA, z =1. But it is very easy to see that 

has a pole of degree n z =1. □ 

The following lemma shows that our definition of growth is natural in the 

sense that polynomial growth of averages Ek{,f) of the function / over the 
spheres Sk implies polynomial growth of the function / on average in the sense 
of our definition. 

Lemma 1.4 // Fk{f) ^ Ck'^ for some constant C, then f{w) has polynomial 
growth of degree at most d. 

Proof. Immediately follows from the previous lemma. □ 
1.7 Negligible sets 

Let R C F and x be the characteristic function of R. Wc say that a set R is 
polynomially negligible if for every positive integer d, the polynomial function 
\w\'''x{w) restricted to R has growth of degree at most on average. 

The purpose of this concept is that, in computations of degrees of growth 
on average, we can use a 'cut and paste' technique and ignore any polynomial 
function of any degree with support restricted to R. 
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Theorem 1.5 Let B. C F and fk = \Rr] Sk\/\Sk\ be the relative frequency of 
elements of R in the sphere Sk- Assume that the function 

fe=0 

can be continued analytically to a neighborhood of and is regular at 0. Then 
the set R is polynomially negligible. 



Proof. We can assume without loss of generahty that R C F \ {1} hence 
/o = 0. FoUowing the same hne of argument as in Lemma 1.3, we replace \w\" 
with the larger function 

g{w) = {\w\ + l)(|w| + 2) • • • {\w\ + n) 

and set g{l) = 0. Let z = 1 — s and r{z) = YlifkZ^- Observe that r{z) is 
analytic and regular in a neighbourhood of z = 1. It is enough to prove that 

oo 

G{z) = {l-z)- J2{k + + 2) • • • (fc + n)fkz'' 

k=0 

is regular at z = 1. But this is obvious because 

G{z) = il-z)—{z"+'-r{z)) 

is regular at z = 1. □ 
Corollary 1.6 // the relative growth rate j{R) < 1 then R is negligible. 

Proof. Since the radius of convergence r of the series J2 fk{R){^ — s)'' is 
computed as r = l/-f{R), wc sec that the function ^ fk{R){l — s)*^ is analytic 
and regular in the vicinity of 0. Hence R is negligible by Theorem 1.5. □ 



2 Normal subgroups and cogrowth 

2.1 Non-recurrent and non-amenable factor groups 

Let G be a finitely generated group with an atomic probability measure v : G 
[0, 1]. The measure v is called symmetric if i^{g) = i^ig^^) for all g & G. The 
support of u is defined as 

supp(z/) = {geG\ v{g) + 0}. 

With a given measure v on G one can associate a random walk Wy on G such 
that the transition probability from 5 to /i is equal to i'{g~^h). A finitely 
generated group G = F/R is called recurrent, if it admits a symmetric atomic 



A. V. Borovik, A. G. Myasnikov and V. N. Remeslennikov • Multiplicative measures • 11.04.02 1 2 



probability measure v : G ^ [0,1], whose support generates G, and such that 
the corresponding random walk W^, on G is recurrent. Recall, that a random 
walk is recurrent if it returns to 1 infinitely many times with probability 1, i.e., 
the series 

n=0 

where p^"^(l) is the probability for the walker to return to 1 in n steps, is 
divergcnit. By a result of Varopoulos [36] based on Gromov's polynomial growth 
theorem [12], a group is recurrent if and only if it is finite or a finite extension 
of Z or Z^. Grigorchuk gave in [11] another useful characterization of recurrent 
groups: the group G = F/R is recurrent if and only if the series 

>^ nkjR) 

diverges. Observe, that the latter is equivalent to the condition that X^^q fk{R) 

is divergent, i.e., A(i?) = oo. 

Theorem 2.1 Let R be a normal subgroup in a free group F. If the factor 
group F/R is not recurrent then R is sparse. 

Proof. Let F be a free group of rank m, i? be a normal subgroup of F such 
that F/R is not recurrent. Let nu = \Rr\ Sk\, and fk = nk/\Sk\- Since F/R 
is not recurrent, it is infinite and, by a result of Woess [37], the asymptotic 
density p{R) = lim^^oo fk exists and equal to 0. By the criterion above the 
series X^fc^o fk converges. Therefore it is Abel summable, i.e., there exists a 
limit 

DC CO 

^hm EA(l-«f = E/fc 

k=0 k=0 

and 

oo 

^i{R) = s-E^(i-s)' 

= iJ.is + o{s), 

where 

oo 

Ml = E f''- 

k=0 

□ 

A classical criterion of amenability, due to Cohen [7] and Grigorchuk [11] 
claims that a finitely generated group F/R is amenable if and only if the cogrowth 
coefficient limsup(nfc(i?))^/'^ = 2m — 1. This immediately gives the following 
result. 

Theorem 2.2 Let R be a normal subgroup in F. If the factor group F/R is 
not amenable then R is sparse and polynomially negligible. 
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Proof. Wc have mentioned in Section 1.2 that if j{R) < 1 then R is A- 
measurable, i.e., the series J2k^=o fk{R) converges. In the same time, 

i/fe 



2m -1 ^V(2to-1)^ 

nk{R) 



Hm sup 



^2m(2m- 1)'=-! 
= Hmsup(/fe)^/'= 
= 7(ii) 

Hence if hmsup(nfe(i?))"'^/'^ < 2m — 1 then 7(-R) < 1 and R is sparse by 
Lemma 1.1. By Corollary 1.6, R is polynomially negligible and the theorem 
follows. □ 

It is worth mentioning a corollary from the proof: since the convergence 
radius of the generating function 



k=0 

-1 



is ^limsup (njt(i?,))"'^''*^ j , we have: 
Corollary 2.3 The convergence radius of N{R){t) is ((2m — 1)7(-R))~^. 

2.2 Return generating function 

Let M = M{X) be the free monoid generated by X U and 

ri : M{X) F{X) 

the canonical cpimorphism of monoids which is induced by the identity map on 
X U Analogously to free groups, denote by Sk{M) the set (sphere) of all 

words in M of length k. Then, given a normal subgroup R<F, we can consider 
two generating functions, 

N{t) =^nkt^ and N*{t) =^nlt'', 

where Uk = nk{R) = |i?n S'fc | and n* = n*(i?) = |r/-i(i?) n S'fc(M)| (the number 
of words of length fc in M which are mapped into Rhy rj). The function N*(t) 
is called the return generating function. 

The following formula links the functions N{t) and N*{t) for a normal sub- 
group R<iF: 

Njt) ^ ^* (i+(2r>^-i)«0 

1-t^ 1 + (2m - 1)42 • ^ 

(In [2] Bartholdi proved a more general result (see Section 2.3), attributing 
Equation (10) to Godsil [10, p. 72].) 
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Example 1. Wc shall use Equation (10) for the computation of the 
measure of the co-diagonal subgroup D of F, that is, the kernel of the 
homomorphism F — » Z defined by mapping all generators Xi € X to the 
generator 1 of Z. 

It is easy to see that there are ( ^/^ ) rri^'' words of length 2k in M which 



are mapped by rj into D. Indeed, these words are 2fe-tuples of elements 
Xi (and there are m^'' of them) with the exponents ±1 assigned to them 

such that the sum of exponents is 0; there are ( ^ ) assignments of 



exponents. Since 



fe=0 



(see sequence A000984 of [30]), 



and 

N{t) 



(1 + (2m - l)t2) ^/T^ 



W«2 



(l+(2m-l)t2)2 



(2m- 1)2*2 



A close look at the zeroes of the denominator in the expression for N{t) 
tells us that the convergence radius of A''(*) is l/(2m— 1), and Corollary 2.3 
reinterprets this statement as 7(-D) = 1. A direct computation shows that 



An analysis along the lines of the proof of Theorem 1.5 shows that D is 
not polynomially negligible. 

Notice that 

nt 

Pk 



{2mY 

is the probabiUty for a simple random walk^ on T to return to the initial vertex 
H after k steps. A considerable body of Uterature on random walks on groups 
contains various information about the return probability generating function 
P{t) = 'Yl/k=aPt''^'^ (which is a special instance of the Green function of the 
random walk). Since N*(t) — P{2mt) and the frequency generating function 

oo oo 
k=0 fc=l ^ ^ 



2This means that we move from a vertex to any of 2m adjacent vertices with equal prob- 
abilities l/2m. 
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is related to N{t) as 



2m - 1 / / t 
P{t) = 1 + — N \ 



2m \ \ 2m — 1 

we easily convert (10) into the following formula: 

{2m-lf-t' ( 2mt \ 1 
2m (2m-l) + i2 \{2m- 1) J 2m' ^ ' 

Example 2. For a normal subgroup N < F, Equation 11 reduces the 
question of algebraicity of the function ij,{N) to that one for the generating 
function N*{t) = ^nlt'' for the number of non-reduced words in N. 
Assume that F has rank 2 and take for A'^ the derived subgroup N = 
[F, F] of F. Then non-reduced words of length k from the free monoid 
M correspond to simple random walks of length k on the factor group 
F/N ~ Z X Z which start and end at 0, the probability of that event 
being nl/{2rn)''. We found ourselves in the classical realm of random 
walks on lattices. A paper by Montrol [25, p. 201] (sec also [24]) contains 
a closed formula for the return probabilities generating function for a 
simple random walk on Z x Z: 

2 „ 



P{t) = — Q-1/2 



-KZ ' \ 

where Q-1/2 (2) is a Legendre function of the second kind. As shown in 
[25, Equation 22 on p. 201], 

Since fJ.s{N) = sF{l — s), after an easy calculation with Equation 11, we 
see that the function //(AT) is not algebraic and 

,,,, sloes 
HsiN) ~ — as s -» 0. 

TV 

In particular, Hs{N)/s has logarithmic divergence at s = 0, pi does not 
exists and N = [F, F] is a subgroup of intermediate density. 



2.3 Non-normal subgroups and random walks on regular 
graphs 

In this section, we transfer Theorem 2.2 from normal to arbitrary subgroups of 
F. 

If iJ is a (not necessarily normal) subgroup of F = F{X), the set F/H of 
right coscts gives rise to the Schreier graph of H , denoted by T, if we connect 
the cosets Hy and Hyx, x E X,hy a directed edge marked x. Every closed path 
in r from H to H gives a word in the free monoid M{X^^) which represents an 
element from H, if we read tlie edge label when we go along the edge, and its 
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inverse, if we go against the direction of the edge. Reduced words correspond 
to paths without backtracking of edges. 

Notice that F is is a 2m-regular graph, that is, every its vertex has valency 
2m. 

Denote by rife the number of closed paths without backtracking of edges 
which start and end at the vertex H. Notice that rife = nk{H) is exactly the 
number of reduced words of length k in H . Also, denote by bk = bk{H) the 
number of all paths of length k from H to H, and let 

OO OO 

N{t) = J2 nkt^ and B{t) = ^ bkt'' 



be the corresponding generating functions. 

Formula (10) is a special case of the following result valid for all regular 
graphs [2]: 



N{t) ^ (l + (2m-l)t2 



(12) 



1-t^ 1 + (2m - l)i2 • 
Denote by 

_ bk 
" (2m)fc 

the probability for a simple random walk on F to return to the initial vertex H 
after k steps. The quantity 

1/ = lim sup ^/pk 
is called the spectral radius of F. Obviously, v ^1. 

Theorem 2.4 // the coset graph F of a subgroup H < F has spectral radius 
V <1 then H is sparse and polynomially negligible. 



Proof. Let r' and ;•" be the convergence radii of the formal power series N{t) 
and B{t), then, by the well-known result from calculus, 

r' = (lim sup ^/uk)^^ and r" = (lim sup \/bk)~^ ■ 

If < 1 then, obviously, r" > l/(2m). The formula (12) relates the convergence 
radii of N(t) and B{t) (see also [26] where this relation was developed earlier). 
It is easy to see that r' > l/(2m — 1), hence for the relative growth rate of H 
we have: j{H) < 2m — 1, and by Corollary 1.6 is sparse and polynomially 
negligible. □ 

A similar technique with the use of results from [18, Chapter 7] proves the 
following theorem. 

Theorem 2.5 (T. Smirnova-Nagnibeda, private communication) // 

H < F is a subgroup of infinite index then its asymptotic density 

p{H) = 0. 

In particular, it follows that iJo{H) = 0. 
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2.4 Preimages of quasiconvex subgroups 

of hyperbolic factor groups are negligible 

Recall that a finitely generated group G is word- hyperbolic if for any (some) 
finite generating set S of G there is ^ ^ such that all geodesic triangles in in 
the Cayley graph C(G, S) of G with respect to S are 5-thin, that is. each side 
is contained in the closed ^-neighbourhood of the union of the other two sides. 
A subgroup of a word-hyperbolic group G is quasiconvex if for any (some) 
generating set 5 of G there is e ^ such that every geodesic in C{G, S) with 
both endpoints in H is contained in the e- neighbour hood of H. 

In [15, Theorem 1.2] I. Kapovich proved that the coset graph G/H of a 
quasiconvex subgroup 7J of a hyperbolic group G has spectral radius < 1. Now, 
as an application of Theorem 2.4 we have the following result. 

Theorem 2.6 Let R < F be a normal subgroup of a free group F such that 
F/R is a non- elementary word-hyperbolic group, and R ^ H < F a subgroup of 
infinite index in F such that H/R is a quasiconvex subgroup of F/R. Then H 
is sparse and polynomially negligible in F. 

3 Measure of a regular set 

3.1 Reguleir Languages and finite automata 

In this section we show how to compute the measure fi{R) of a regular subset of 
the free group F = F{X) of rank m. Most of the results here are just a proper 
interpretation of some well-known facts about regular sets. We refer to [8] for 
detailed discussion of regular subsets of F. 

Recall that a finite automaton ^ is a finite labelled oriented graph (possibly 
with multiple edges and loops). We refer to its vertices as states. Some of the 
states arc called initial states, some accept statc;s. Wc assume further that every 
edge of the graph is labelled by one of the symbols x^^,x £ X. A path in A 
is a set of edges eo, ■ ■ ■ ,ei such that, for each i = 1,. . . ,1, the endpoint of Cj-i 
is equal to the starting point of e^. Reading the labels on edges along the path 
in the natural order, we get the label of the path. The language accepted by 
an automaton A is the set C = C{A) of labels on paths from an initial state 
to an accept state. An automaton is said to be deterministic if, for any state, 
there is at most one arrow with the given label exiting from the state. A regular 
set is a language accepted by a finite deterministic automaton. For every finite 
deterministic automaton A one can construct a finite deterministic automaton 
A* such that £.{A) = £{A*) and where the sets of initial and accept states are 
disjoint. It would be convenient for us to work only with non-empty words, that 
is, elements in F \ {1}. 

We assemble here some (mostly well known) facts about regular sets. 

Theorem 3.1 Let A and B are regular subsets in F. 
• The sets Au B, Ad B and A \ B are regular. 
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• The prefix closure A of a regular set A is regular. Here, the prefix closure 

A is the set of all initial segments of all words in A. 

• If C is a regular set in the free monoid M freely generated by X U 
then its image C under the natural reduction homomorphism M — > F is 
regular. 

• The product 

AB = {ab\aeA,beB} 

and the set of inverses 

A-^ = {a-^ \aeA} 

are regular. 

• Every finite subset in F is regular. 

• If (j) : F — > F is an endomorphism then the set (}>{A) is regular. 

Now we will show how to compute the measure /Lt(£) of a regular language 
accepted by a finite deterministic automaton A. Recall, that the measure fj, = 
{Hs} gives rise to a multiplicative measure fj,* = {/U*} 

such that 

^*(w)=fl^l, where t= 

'^^^ ^ 2m -1 

By numbering the states by numbers l,...,n, we can associate with the 
automaton A its adjacency matrix A by taking an n x n matrix and writing the 
number of arrows from from state i to state j in the position {i,j). It is easy 
to see that the number of different paths of length I from state i to state j is 
{A^)ij and the measure of the set of labels on these paths is . Let / and 

J be the sets of initial and accept states. If we denote T = tA then it follows 
that 

/.:(£)= i{Th + iT%, +...). 

iei,jeJ 

In particular, the series on the right converges for every given s. Denote by 
B = T + T'^ + ■ ■ ■ the matrix with entries from the ring of formal power series 
then, obviously, 

iei,jeJ 

and 

B = {i^- r)-i - /„. 
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We come to the following formula: 

E {{In-T)-'-Inh. 

iei,jeJ 

If we replace A by the automaton A* which accepts the same language and 
where an initial state is never an accept state, we can simpliiy the formula and 
write 

M*(/:)= E ((/„-T)-%.. 

ieijeJ 

We have as a corollary the following result. 

Theorem 3.2 The measure /i*(-R) {and hence the probability measure H^R)) of 
a regular subset of F is a rational function in t {and hence in s) with rational 

coefficients. 

We can now apply this theorem to the Cesaro density (see Section 1.4) and 
asymptotic classification of regular sets (see Section 1.5). 

Corollary 3.3 The Cesaro density of a regular set is a rational number. 

Corollary 3.4 Every regular set is either thick or sparse. 

3.2 Thick regular sets 

We describe below thick regular sets. 

A cone C = C{w) with the vertex w is a set of all elements in F containing 
the given word w as initial segment. Obviously, cones are regular sets. 

Let -B = {u I |u| ^ k—1} be the ball of radius fc— 1. Then F\B is the union 
of |S';s| cones each of which has the same measure as the given cone C = C{w) 
with \w\ = k. Hence 

KC{w))=^-^J^ = -^+0{s). 

\^\w\\ \^\w\\ 

In particular, a cone is a thick regular set. The following theorem shows that 

every thick regular set involves a cone. 

Theorem 3.5 Let R be a regular subset of F. Then R is thick if and only if 
its prefix closure R contains a cone. 

Proof. Notice that if a regular set R is accepted by a finite deterministic 
automaton A, then its prefix closure R is accepted by the automaton A obtained 
from A by extending the set of accept states by adding all states which belong 
to a directed path in A from an initial state of A to an accept state of A. 

Since cones are thick sets, one direction of our theorem immediately follows 
from the following lemma. 

Lemma 3.6 Let R C F be a regular set. Then R is thick if and only if its 
prefix closure R is thick. 
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Proof. Of course, if R is thick then R is thick. To prove the reverse, we use 
the obvious observation that the union of finitely many of sparse sets is sparse 
(if /Lto-measurable). Let A he a. finite deterministic automaton which accepts R 
and ui, . . . ,f„ the accept states of A. Denote by Ri the subset of R accepted 
by the state Vi. Then R = RiU- ■ - URn and one of the regular sets Ri is thick. 
If w is a label on a directed path from Vi to an accept state, say Vj , of A then 

Ri ow = {x o w \ X ^ Ri} 

is obviously a thick set and belongs to R. □ 

Now we can assume that the set R is thick. Since the union of finitely many 
sparse sets is sparse, we can assume without loss of generality that a finite 
deterministic automaton A for R has only one initial state / and one accept 
state Z. We have to remember that our automaton accepts only reduced words. 
Therefore A can be rewritten in the form where 

(a) For any state A of A, all arrows which enter A have the same label a G 
X U and arrows exiting from A cannot have label (this can be 
achieved by splitting the states of A in the way shown on Figure 1.) We 
shall say in this situation that A has type a. 

(b) For every state A of A, there is a directed path from A to the accept state 
Z. 

(c) In addition, it is easy to arrange that there are no arrows entering the 
initial state I. 




Figure 1 : Splitting the states of the automaton A. 

This means, in particular, that there are at most 2m arrows exiting from the 
initial state /, and at most 2m — 1 arrows exiting from any other state. We can 
assign frequencies l/2m to arrows exiting from I and frequencies l/(2m— 1) to 



A. V. Borovik, A. G. Myasnikov and V. N. Remeslennikov • Multiplicative measures • 11.04.02 21 



arrows exiting from other states. Now, for a word w accepted by A, its relative 
frequency 

X(w) = -i — i — - 

^ ' 2m.(2m- l)!™!-! 

is the product of frequencies of arrows in a directed path from the initial state 
/ to the accept state Z which correspond to the word w. We aim at proving 
the following statement from which our theorem immediately follows by virtue 
of Lemma 1.1: 

// R contains no cone then it is X-measurable, that is, 

X{R) = J2 ^(«') 
weR 

is finite. 

For that purpose form the automaton obtained from A by removing 

all arrows exiting from Z; we take / and Z for its initial and accept states, 
correspondingly. Consider also the automaton A2 formed by all states accessible 
from the state Z, with the same arrows between them as in A; we take Z for 
the both initial and accept states. 

We assign to arrows in Ai and A2 the same frequencies as to arrows in A. 
Since / does not belong to A2, all arrows in A2 have frequencies l/(2m — 1). 
If now i?i and R2 are languages accepted by Ai and A2 then, obviously, R = 
i?i o i?2. Moreover, if u G Ri and w G i?2 then the word uv is reduced and 

X{uv) = X{u)X*{v). 

Since the presentation of R in the form R = Rio R2 is unambiguous, it follows 
that 

A(i?) = A(i?i)A*(i?2). 

Transform the automaton A2 further by splitting the state Z into separate initial 
state Zi (with no arrows entering it, and those arrows which exited from Z now 
exiting from Zi), and the accept state Z2 (with no arrows exiting from it, and 
those arrows which entered Z now entering Zi). If R3 is the language accepted 
by the new automaton ^3, then, obviously, 

i?2 = -R3 U {R3 o Rs) U (ii3 o i?3 o iia) U • • • 

and 

X*{R2) < X*{R3) + X*{R3f + X*{R3f + ■■■. 

Assume that R contains no cone. Then the both subsets .Ri and R2 contains 
no cone. 

Let us look first at A2- Assume that, for every state A of A2 of type 
a € X U X~^, every possible label from X U X~^ \ {a} is present on one of 
the arrows exiting from A. Then it is easy to see that R2 contains a cone. 
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Therefore wc can assume that, for some state A, there are less than 2m — 1 
arrows exiting from A. If we now look at the automaton ^3, it becomes obvious 
that A*(-R3) < 1. To see this formally, we can consider a Markov chain A4 whose 
states arc the states of As together with a additional dead state D (see [17] for 
background material on Markov chains). We set the transition probabilities 
from Z2 to Z2 and from D to D being equal 1. Every arrow in A3 corresponds 
to a transition in A4 with the transition probability l/(2m— 1). If at some state 
A of A2 there is no arrow labelled b G X U X~-^ exiting from A, we make in 
A4 a transition from A to D with the transition probability l/(2m — 1). The 
probability distribution on A4 concentrated at the initial state Zi , converges to 
the steady state P which is zero everywhere with the exception of the two dead 
states Z2 and D. Since P{D) ^ 1, P{Z2) < 1. But, obviously, P{Z2) = A*(ii3). 

Now the summation of the geometric progression for A*(i?2) shows that 
A*(i?2) < 00. 

An analogous argument for Ai shows that A(J?i) < 00. Therefore X{R) < 00. 
□ □ 

3.3 Measures of finitely generated subgroups 

Let F = F{X) be the free group with basis X. It is well known that finitely 
generated subgroups in F arc regular sets; the most suitable for our purpose 
exposition of this and similar results can be foimd in [16]. 

Let /i* be the adjusted multiplicative measure on F. Let H he a subgroup 
of F generated by elements hi, . . . ,hk- We shall slightly modify the arguments 
from the previous section to produce a somewhat more practical procedure for 
computing the measure fx*{H). In particular, ^i*{H) will be expressed as a 
rational function of measures ii*(wi) of certain words wi, . . . ,Ws which do not 
depend on choice of generators h\, . . . ,hm'^^ H although can be easily computed 
from them. 

Let F be the core subgroup graph of H in sense of [16]. Notice that F does 
not depend on a particular choice of generators of H. We mark on F the initial 
vertex 1 and those vertices which have degree at least 3. This new vertex set 
V* can be turned into a digraph F* with edges labelled by freely reduced words 
from F. To do so, we define edges of F* to be reduced paths in F which start 
and end at vertices in V* and do not pass through any other vertex from V* . 
The label of the path becomes the label of the corresponding edge in F*. We 
call F* the consolidated subgroup graph of H. 

Now it is easy to see that, since F is folded, a reduced path in F*, viewed 
as a path in F, is also reduced. Every element h G H is the label of a reduced 
path in F from 1 to 1, as well as, the label Wi o . . . o wi of the corresponding 
reduced path in F* from 1 to 1. It follows that = iJ,*{wi) ■ ■ ■ fi*{wi). 

Our description of the matrix method of computing the measure of a finitely 
generated subgroup will be illustrated by the following example, which we do 
in parallel with the formal discussion. 



Example 3. Let C be a subgroup generated by a single element c. Ob- 
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viously c can be presented in the form c — uvu~^ without cancellations 
between the words u, v and . The consolidated subgroup graph r*(C) 
of C has the form 




We start with the consoUdated subgroup graph T* of H. If e is an edge in 

r*, wc denote its label by A(e). As usually, we use the convention that for every 
edge e we also have an edge with the opposite direction and the inverse label 
X{e)-\ 

so, in our example, V*{C) is a digraph with 4 directed edges. 

The process of writing non-trivial random words from H can be described 
by the automaton T which consists of one state for each directed edge of the 
digraph F* plus one initial state. Every directed edge e of F* is interpreted as 
the state "we wrote the word A(e) of e" , the initial state is "we wrote the empty 
word". If the origin of edge / is the terminus of edge e, we say that there is a 
transition from the state "we wrote the word A(e) to the state "we wrote the 
word A(/)", and we assign to this transition measure ^*(A(/)). To a directed 
edge e which exits from the initial vertex 1 of F* , we assign the transition from 
the state "we wrote the empty word" to the state "we wrote the word A(e)" with 
measure /^*(A(e)). Finally, the accept states of our automaton T correspond to 
directed edges of F* whose terminuses are the initial vertex 1. 

We label the states of T by consecutive numbers 1, 2, 3, . . ., so that the initial 
state "we wrote the empty word" has label 1. Now the transition measures of 
automaton T form a matrix which we denote A. 

In our Example 2, the states are: 

1 We wrote an empty word (the initial state) ; 

2 we wrote u; 

3 we wrote v; 

4 we wrote v~^; 

5 we wrote (the accept state), 
and the transition matrix is 








p 








M 










<i 


q 













g 





p 













9 


p 


V 














/ 



where p = n*{u) and q = fi*{v). 
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Let j'l, . . . , jk be the accept states of T. The multipUcativity of the adjusted 
measure n* allows to use the same matrix technique as in computations on 
Markov chains, and the measure of the set of words from H which can be 
obtained by I moves becomes the sum 

(^^,,+- + (^').,. 

of the matrix elements of the matrix which correspond to transition from the 
initial to an accept state. Notice that we produce only non-trivial elements of 
H. Hence the set H \ {1} of non-trivial elements in H has the measure 

k 

„*(H^{l})=Y,{A + A^ + ... + A- + ..\.^. 

i=l 

Denote B = A + A^ -\ , then, obviously, 

B= [E - A)-^ - E, 

where E is the identity matrix. Since, by our construction, the initial state is 
never an accept state, the matrix elements (-B)iji , • . . , {B)ij^ do not lie on the 
diagonal and therefore {B)ij. = {{E — A)~^)ij^ for alH = 1, . . . , A;. Hence 

k 

M*(ifx{l}) = ^((i5-^)-i)y^. (13) 

Since the elements of the inverse matrix [E — A)~^ are rational functions of 
matrix elements of the matrix A, we proved the following theorem. 

Theorem 3.7 // n is the multiplicative measure on F, the measure ^Ji,{H) of 
a finitely generated subgroup H of F is a rational function of the measures of 

labels on the consolidated subgroup graph of H . 
In particular, ^(H) is a rational function of s. 

In Example 3, a direct computation with Mathematica shows that 

PI pq "ip^Q \ 

l-<3 1-9 1-9 
9 9 ipQ 

1-9 1-9 1-9 

— ^- 

1-9 ^ 1-9 

1-9 1-9 

1 / 

and 

M*(Cx{l}) = 1^ 



(E-A)-' 



( 1 p 

1 





V 
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Hence 



MC7x{l}) 



2m -1 2( 



/ _l--s_ \ 2 1 li I + 1 ^; 1 
2m-l ' 



2m 1 



1 - (^ri^ll"! 
y2m.-l' 



2 



• s + o{s). 



2m(2m- • (1 - ( 



1 



2m,-l 



3.4 Normal subgroups of finite index 

Kouksov [19] proved that a normal subgroup N < F has a rational cogrowth 
function f{t) = ^ntt'^ if and only if the index \F : N\ is finite. In that case 
N is finitely generated, and its subgroup graph F is the Cayley graph of G. In 
notation of (10), the generating function N*{t) for the number of non-reduced 
words in N has a very beautiful form found by Quenell [35] : 



where Aj are the eigenvalues of the adjacency matrix of F. 

4 Context free languages 

Combinatorial analysis of context free languages is a well established area of 
combinatorics with powerful tools for manipulating generating functions of lan- 
guages; a very good exposition of the theory, pioneered by Chomski and Schutzen- 
berger [6], can be found in [9, 34]. Here we give only a small example of use of 
this machinery, motivated by applications of our methods to study of complex- 
ity of algorithms on amalgamated products of free groups [3] . We do not give 
rather technical and lengthy definitions related to context-free languages which 
can be found in [28] (see also [38] for a compact formal definition). 

Let M be the free monoid generated by X U X~^. We call two subsets 
R,ScM isobaric if, for every fc, they contain equal number of words of length 
k, that is, if the have the same generating function. 

Example 4. Let X = Y U Z, where \Y\ = I ^ 1 and \Z\ = n ^ 1. We 
shall find the measure of the set 




R=\J FiY)^" 



of all elements in F conjugate to non- identity elements in F{Y). Here, 
as usually, we denote by F(y)'' the set of non-identity elements of F{Y). 
Obviously, we can decompose 



R = F{Y)>u[jF{Yf, 
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where the union is taken over aU elements in F which start with letters 
in Z U . The generating function for F{Y)^ is obvious: 

CO 

fit) = ^ 2^(2/ - if -^t* 

k=l 

= 2Z-t(l-(2Z-l)f)"\ 

Denote Li = F{Y)^ and let L2 be the cone of words in F which start 
from symbols in ZuZ^^ . It is easy to see that, in the free monoid M{X), 
the language L^^ o Li o L2 is isobaric to to the language L\o L\, where 
L| = {/ o / I / e L2} and o denotes formal product in M{X) without 
cancellation. The generating function of L2 is 
00 

git) = 2n(2m - 1)'="^*'= = 2nt(l - (2m - 

fc=i 

and the generating function for L| is g{t^). According to the standard 
rules of computation of generating functions for context-free languages 
[34], the generating function for Li U Li o L% is 

fit) + f{t)9{t^) = 2' • t{l - {21 - l)t)~^{l + 2nf{l - (2m - l)f)~^) 
and therefore 

2m -1 1 - s / , , l-s\-i 

KR) = s • 21 ■ 1 - 2; - 1 . 

2m 2m — 1 V 2m — 1 / 

m V 2m - ly V (2m - 1)2 y V 2m - 1 J 



Notice that by [21], a normal subgroup iV o F is context-free if and only if 
the factor group F/N is free-by-finite. 

5 Addendum: A Tauberian Theorem by Hardy 
and Littlewood 

We found ourselves in the context where generalised summation methods for 
series are essential. 

Example 5. Consider the subgroup H of index 2 in _F which consists of 
all words of even length in F. Let Uk be the number of elements of length 
k in H and 

„ ^ rik_ 
" \Sk\ 

be the relative frequency of elements of length k from H among all ele- 
ments of length k in F. Obviously, 

f _ j i if A; is even 
~ \ if fc is odd ■ 
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One can easily see that 

oo 

s(l-sf 



1 - (1 - sf 



2^ A.^ 8 "^16"^"" 



When s — > 0+, ii{H) — > 1/2. 

To explain the rather expected appearance of 1/2 as the 'Umit probability' of 
the subgroup H in Example 5, we need to invoke one of the so-called Tauberian 
theorems by Hardy and Littlewood. 

Theorem 5.1 [13, Theorems 94] Let {o„} he a sequence of real numbers such 
that the sequence of partial sums 

Sn = 0,0 -\- " ' -\- o,n 

is bounded from below. Assume also that the limit {the Abelian sum o/{a„}) 



lim y^ttfeX* 



fe=o 

exists and equals S. Then the sequence {a„} is Cesaro summable in the sense 
that the limit 

Sq + Si + ■ ■ ■ + Sn 

lim 

n— >(X) n + 1 

exists and equal S. 

Corollary 5.2 Assuiv.e that the sequence {/„} of non-negative real numbers is 
bounded and the sum X^fc^o fki^ ~ s)'^ converges for all < s < 1. Assume, in 
addition, that there exists the limit 



lim sV/fe(l -«)'== MO- 

s— ►0+ — 

Then 



fe=0 



= lim /o + /i + • • • + /fc 

/c->oo A; + 1 



Proof. Set t = 1 — s and rewrite 

oo oo 

s^f,{l-s)' = {l-t)J2f,t'^ 

fe=0 k=0 

= /0 + (/l-/0)t + (/2-/l)i' + ---, 
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and the series on the right converges for all < t < 1. Moreover, 
^lim (^fo + Y^ifk - j = MO- 

Since the partial sums 

/o + (/i - /o) + • • • + ifk - fk-i) = fk 
are bounded from below, the previous theorem yields 

[/o] + [/o + (/i - /o)] + • • • + [/o + (/i - /o) + • • • + ifk - fk-l)] 

= k^To. fcTI 

= lim^° + V-- + ^\ 

fe— >oo fc + 1 

□ 

Corollary 5.2 explains, in particular, that for our subgroup H of index 2 in 
F, /u(-ff) = Mo + where 

^. 1 + + --- + 1 + 1 

Mo = lim = -. 

n— >(x) 2n 2 

The following corollary is an easy consequence of Corollary 5.2 and Theo- 
rem 5.1. 

Corollciry 5.3 Assume that the sequence {fk} of non-negative real numbers is 
bounded, the sum J2T=ofk{^ " ^)'' convergent for all Q < s < 1 and the 
function 

oo 

ti{s) = sY,fk{l-sf 

fe=0 

has the limit 



lim m(s) = HQ. 

s— ^0+ 



Then 

(a) Ho is the Cesaro limit 

= lim /o + /i + • • • + /fc 
fe^oo k + 1 

(b) If Ho =0 and the limit 

Hi =■ lim 

s^0+ s 

exists then the sum ^ fk is convergent and 

oo 

Ml = ^fk- 

k=l 
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(c) // the series ^ fk converges, then no and ni exist, no = and 



fc=o 

(d) In particular, if the function /x(s) is analytic in the vicinity of and 
regular at s = 0, then, in the power series expansion at s = 0, 

= lim /o + /i + • • • + /fc 

fe->oo A; + 1 ' 

and, if fio = 0, the next coefficient is given by 



fc=i 



Proof, (a) directly follows from 5.2. 

For a proof of (b), notice that by Theorem 5.1 



oo 



Ml = ^fk, 
fc=l 

where the sum ^ is understood in the sense of the Cesaro hmit of the 
partial sums Sk = fi -\ -\- fk' 

oo - 

''V/,= lim + + 

^-^ 71— >c!o n 
k=l 

But for non-negative series, Cesaro summability is equivalent to the ordinary 
convergence, which yields the result. 

For (c), assume that the series ^ fk converges. By Abel's theorem on con- 
tinuity of sums of power series, the function /(s) is continuous on the interval 
[0, 1) and hence 

oo 

III = lim /(s) = V fk 

and 



fe=0 



Ho = lim sf{s) = 0. 
(d) is an immediate corollary of (a)- and (b). □ 
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